Using Sequence Package Analysis as a New Natural Language Understanding Method for Mining Government Recordings of Terror Suspects

نویسنده

  • Amy Neustein
چکیده

Three years after 9/11, the Justice Department made the astounding revelation that more than 120,000 hours of potentially valuable terrorismrelated recordings had yet to be transcribed. Clearly, the government’s efforts to obtain such recordings have continued. Yet there is no evidence that the contents of the recorded calls have been analyzed any more efficiently. Perhaps analysis by conventional means would be of limited value in any event. After all, terror suspects tend to avoid words that might alarm intelligence agents, thus “outsmarting” conventional mining programs, which heavily rely on wordspotting techniques. One solution is the application of a new natural language understanding method, known as Sequence Package Analysis, which can transcend the limitations of basic parsing methods by mapping out the generic conversational sequence patterns found in the dialog. The purpose of this paper is show how this new method can efficiently mine a large volume of government recordings of the conversations of terror suspects – with the goal of reducing the backlog of unanalyzed calls.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sequence Package Analysis: A New Natural Language Understanding Method for Performing Data Mining of Help-Line Calls and Doctor-Patient Interviews

Designers of audio mining programs must confront the complexities of natural language dialog, which is replete with ambiguities, circumlocutions and ellipses. Speakers often make requests, lodge complaints, or report on problems in such roundabout ways that attempts to find a statistically probable word match between the application vocabulary and the user’s speech can yield unsatisfactory resu...

متن کامل

Sequence Package Analysis: A New Natural Language Method for Mining User-Generated Content for Mobile Uses

A. Neustein and J.A. Markowitz (eds.), Mobile Speech and Advanced Natural Language Solutions, DOI 10.1007/978-1-4614-6018-3_5, © Springer Science+Business Media New York 2013 Abstract Paradoxically, in an era when cyber-postings proliferate on the Web, much of the valuable information that can be mined from user-generated content (UGC) still eludes most mining programs. One reason this massive ...

متن کامل

Sequence Package Analysis: A New Method for Intelligent Mining of Patient Dialog, Blogs and Help-line Calls

The ambiguities, repetitions and ellipses commonly found in natural language dialog continue to hinder speech (and text) analytic mining programs that glean business intelligence data from consumer help-line calls, or extract important medical diagnostic information from doctor-patient interviews or consumer-generated health-related blogs. This poses an even greater problem when such mining pro...

متن کامل

Sequence Package Analysis and Soft Computing: Introducing a New Hybrid Method to Adjust to the Fluid and Dynamic Nature of Human Speech

At Linguistic Technology Systems, we are using Sequence Package Analysis (SPA) to architect a new, pragmatically-based part of speech tagging program to better conform to the fluidity and dynamism of human speech. This would allow natural language-driven voice user interfaces and audio mining programs – for use in both commercial and government applications – to adapt to the in situ constructio...

متن کامل

Plagiarism checker for Persian (PCP) texts using hash-based tree representative fingerprinting

With due respect to the authors’ rights, plagiarism detection, is one of the critical problems in the field of text-mining that many researchers are interested in. This issue is considered as a serious one in high academic institutions. There exist language-free tools which do not yield any reliable results since the special features of every language are ignored in them. Considering the paucit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006